Minimum Description Length Recurrent Neural Networks

نویسندگان

چکیده

Abstract We train neural networks to optimize a Minimum Description Length score, that is, balance between the complexity of network and its accuracy at task. show optimizing this objective function master tasks involving memory challenges go beyond context-free languages. These learners languages such as anbn, anbncn, anb2n, anbmcn +m, they perform addition. Moreover, often do so with 100% accuracy. The are small, their inner workings transparent. thus provide formal proofs perfect holds not only on given test set, but for any input sequence. To our knowledge, no other connectionist model has been shown capture underlying grammars these in full generality.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Temporal Minimum Description Length Policy for Evolving Neural Networks

One of the most important issues for computational methods is their time complexity. This paper introduces a temporal MDL (minimum description length) policy for evolving neural networks based on their execution time on the hosting hardware. Temporal MDL implements an adaptive selection pressure based on the actual processing time of the evolving solutions and thus favors creation of faster, mo...

متن کامل

Information Geometry and Minimum Description Length Networks

We study parametric unsupervised mixture learning. We measure the loss of intrinsic information from the observations to complex mixture models, and then to simple mixture models. We present a geometric picture, where all these representations are regarded as free points in the space of probability distributions. Based on minimum description length, we derive a simple geometric principle to lea...

متن کامل

Minimum Translation Modeling with Recurrent Neural Networks

We introduce recurrent neural networkbased Minimum Translation Unit (MTU) models which make predictions based on an unbounded history of previous bilingual contexts. Traditional back-off n-gram models suffer under the sparse nature of MTUs which makes estimation of highorder sequence models challenging. We tackle the sparsity problem by modeling MTUs both as bags-of-words and as a sequence of i...

متن کامل

Minimum Description Length Criterion

he intelligibility of speech in communication systems is generally reduced by interfering noise. This interference, which can take the form of environmental noise, reverberation, competing speech, or electronic channel noise, reduces intelligibility by masking the signal of interest. The reduction in intelligibility is particularly troublesome for listeners with hearing impairments, who have gr...

متن کامل

Minimum Description Length Principle

The minimum description length (MDL) principle states that one should prefer the model that yields the shortest description of the data when the complexity of the model itself is also accounted for. MDL provides a versatile approach to statistical modeling. It is applicable to model selection and regularization. Modern versions of MDL lead to robust methods that are well suited for choosing an ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2022

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00489